Vol, 262, No. 33, Issue of November 25, pp. 16990-16997, 1987 

Printed in U.S.A. 

Primary Structure of Human Poly(ADP-ribose) Synthetase as Deduced 
from cDNA Sequence* 

(Received for publication, May 18, 1987) 

Tomohiro KurosakiJ, Hiroshi Ushirot, Yasuhiro Mitsuuchit, Shigetaka Suzukif, Michiko Matsudai, 
Yoshiko Matsuda§, Nobuhiko Katunuma§, Kenji KangawaH, Hisayuki MatsuoH, Tadaaki Hirose||, 
Seiichi Inayama||, and Yutaka Shizutaf 

From the %Department of Medical Chemistry, Kochi Medical School, Nankoku, Kochi 781-51, Japan, the ^Department of 
Enzyme Chemistry, Institute for Enzyme Research, School of Medicine, the University of Tokushima, Tokushima 770, Japan, the 
^Department of Biochemistry, Miyazaki Medical College, Miyazaki 889-16, Japan, and the ^Pharmaceutical Institute, Keio 
University School of Medicine, Tokyo 160, Japan 



The Journal or Biological Chemistry 

<Ci 1987 by The American Society for Biochemistry and Molecular Biology, Inc. 



Human poly (ADP-ribose) synthetase consists of 
three proteolytically separable domains, the first for 
binding of DNA, the second for automodification, and 
the third for binding of the substrate, NAD (Ushiro, 
H., Yokoyama, Y., and Shizuta, Y. (1987) J. Biol 
Chem. 262, 2352-2357). We have isolated and se- 
quenced cDNA clones for the enzyme using synthesized 
oligodeoxyribonucleotide probes based on the partial 
amino acid sequence of the protein. The open reading 
frame determined encodes a protein of 1,013 amino 
acid residues with a molecular weight of 113,203. The 
deduced amino acid sequence is consistent with the 
partial amino acid sequences of tryptic or a-chymo- 
tryptic peptides and the total amino acid composition 
of the purified enzyme. The native enzyme is relatively 
hydrophilic as judged from the hydrophilicity profile 
of the total amino acid sequence. The net charge of the 
NAD binding domain is neutral but the DNA binding 
domain and the automodification domain are consid- 
erably rich in lysine residue and quite basic. The DNA 
binding domain involves a homologous repeat in the 
sequence and exhibits a sequence homology with local- 
ized regions of transforming proteins such as c-fos and 
v-fos. Furthermore, this domain contains a unique se- 
quence element which resembles the essential peptide 
sequences for nuclear location of SV40 and polyoma 
virus large T antigens. These facts suggest the possi- 
bility that the physiological function of poly(ADP-ri- 
bose) synthetase lies in its ability to bind to DNA and 
to control transformation of living eukaryotic cells like 
the cases of those oncogene products. 



Poly(ADP-ribose) synthetase, an enzyme localized in the 
nucleus of eukaryotic cells, catalyzes the polymerization of 
the ADP-ribose moiety of NAD to form a bipolymer, 
poly( ADP-ribose), which is covalently bound to various nu- 
clear proteins (1, 2). A unique feature of this enzyme is that 
it requires DNA for catalytic activity and that it is subjected 
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to automodification during the reaction (3, 4). 

Recently, the enzyme has been purified to homogeneity 
from various sources and extensively characterized (5-8). It 
has been demonstrated that the enzyme consists of three 
proteolytically separable domains, the first for binding of 
DNA, the second for automodification, and the third for 
binding of the substrate, NAD (7-12). Whereas the physio- 
logical function of this enzyme is not as yet fully understood, 
several lines of evidence suggest that it may be involved in 
many biologically important processes such as DNA repair, 
DNA replication, RNA synthesis, and cell differentiation (See 
Ref. 2). Nevertheless, how poly(ADP-ribosyl)ation partici- 
pates in these important biological mechanisms and how the 
gene for poly(ADP-ribose) synthetase is regulated in eukar- 
yotic cells remain to be elucidated. In order to provide an 
initial molecular genetic approach to investigate the physio- 
logical role of the enzyme in living cells, we have isolated 
cDNA clones representing most of a 4.9-kilobase mRNA for 
poly (ADP-ribose) synthetase in human placenta. 

In this paper, we report the isolation of cDNA clones for 
the mRNA and present the nucleotide sequence of the cloned 
cDNAs which allows us to predict the complete amino acid 
sequence of this polypeptide. Structural characteristics of the 
three functional domains as deduced from cDNA sequence as 
well as the homology of the predicted amino acid sequence of 
each domain with those of other proteins are also discussed 

EXPERIMENTAL PROCEDURES 1 
RESULTS 

cDNA and Protein Sequences — Fig. 2 shows the restriction 
map and the sequence strategy for the cloned cDNAs. The 
nucleotide sequences were determined on both strands of the 
cDNAs for all but 365 residues corresponding to the 3' end of 
the mRNA; for this region, sequence determination on both 
strands was technically difficult, but the sequence data were 
reliable. Fig. 3 shows the 3792 nucleotide sequence (excluding 
the poly(dA) tract) of the cDNA encoding human poly(ADP- 
ribose) synthetase, determined using clones pPARSl, 
pPARSll, pPARS21, pPARS3l, pPARS32, pPARS41, and 



1 Portions of this paper (including "Experimental Procedures, "part 
of "Results," Figs. 1, 3, 6, and 7, and Tables I and II) are presented 
in miniprint at the end of this paper. Miniprint is easily read with 
the aid of a standard magnifying glass. Full size photocopies are 
available from the Journal of Biological Chemistry, 9650 Rockville 
Pike, Bethesda, MD 20814. Request Document No. 87M-1665, cite 
the authors, and include a check or money order for $5.60 per set of 
photocopies. Full size photocopies are also included in the microfilm 
edition of the Journal that is available from Waverly Press. 
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Fig. 2. Strategy for sequencing cloned cDNA encoding human poly(ADP-ribose) synthetase. The 

restriction map displays only relevant restriction endonuclease sites, identified by numbers indicating the 5'- 
terminal nucleotide generated by cleavage (for the nucleotide numbers, see Fig. 3). The poly(dA) • poly(dT) tract 
and the poly(dG)-poly(dC) tails are not included in the restriction map. The protein-coding region is indicated by 
a ctosed box, the sequence used for specific priming of reverse transcription by a smatt open box, and the sequence 
used as hybridization probes for selecting clones by a hatched box. The coding region verified by the amino acid 
sequence analysis using 14 different peptides of human placental poly(ADP-ribose) synthetase is shown by a dotted 
box. The direction and extent of sequence determinations are indicated by horizontal arrows under each clone used. 
The star symbols at the end of arrows denote that sequencing was performed using deletion mutants. 



pPARS-F. No sequence difference was observed among the 
overlapped cDNA sequences determined with the individual 
clones. All of the 14 partial amino acid sequences determined 
by peptide analysis were found to be encoded by the cDNA 
sequence in the same reading frame (Figs. 1 and 2 and Table 
H). 

The translational initiation site is assigned to be the me- 
thionine codon composed of nucleotide residues 1-3 because 
this is the first ATG triplet that appears downstream of the 
nonsense codon TAG (residue —9 to —7) found in the frame. 
This assignment is supported by the fact that the nucleotide 
sequence around the ATG triplet agrees well with the favored 
sequence that flanks functional initiation codons of 
£XXAUGG, where X can be any nucleotide (30). The codon 
specifying the tryptophan residue at position 1013 is followed 
by the translational termination codon TAA. The length of 
the 3'-noncoding region of the cDNA excluding the poly(dA) 
tract was estimated to be 658 residues by sequence analyses 
using clones pPARSl and pPARS-F. 

From the cDNA sequence, we concluded that poly(ADP- 
ribose) synthetase from human placenta consists of 1,013 
amino acid residues. The molecular weight of the protein was 
calculated to be 113,203. This value is in good agreement with 
that reported for the molecular weight of the enzyme from 
human placenta (8). Furthermore, the amino acid composition 
of the enzyme as deduced from the cDNA sequence reasonably 
agrees with that experimentally determined (Table III). 

Northern Blot Analysis of the mRNA — To determine the 
size of the mRNA encoding poly(ADP-ribose) synthetase, a 
series of Northern hybridization experiments were performed 
using poly(A) + RNA from human placenta and restriction 
fragments of the cDNA inserts of clones pPARSl, pPARS21, 



pPARS31, and pPARS41. Only one size species of mRNA 
was found in all cases. The size of the mRNA for the enzyme 
was estimated to be 4.9 kilobase, based on its electrophoretic 
mobility relative to known standards (Fig. 4). 

Characteristics of the Three Domains— Judging from the 
amino acid sequence of the N terminus of the 54-kDa frag- 
ment as determined by peptide analysis, we concluded that 
the NAD binding domain consists of 489 residues starting 
from residue 525 to the C -terminal tryptophan of the enzyme 
protein. The molecular weight of 54,881 calculated from the 
predicted amino acid sequence (Table III) coincides well with 
the experimentally determined value of 54,000 for the molec- 
ular weight of the NAD binding domain. 

The exact identification of the splitting site of the enzyme 
by papain to form the 44-kDa fragment (the DNA binding 
domain) and the 72-kDa fragment was difficult because pep- 
tide analysis revealed microheterogeneity of the N -terminal 
sequence of the 72-kDa fragment It was noted, however, that 
three similar amino acid sequences (Thr-Ser-Ala-Ser-Val-Ala, 
residues 361-366, Ser-Thr-Ala-Ser-Ala-Pro, residues 371- 
376, and Ser-Ser-Ala-Ser-Ala-Asp, residues 381-386) were 
coded in the localized region near the N terminus, at which 
cleavage of peptide bond resulted in the formation of the 44- 
and the 72-kDa fragments (see Fig. 3). In fact, the N-terminal 
sequence of Ala-Ser as well as that of Ala- Pro was suggested 
by peptide analysis of the 72-kDa fragment as described 
before. Therefore, we concluded that papain cleaved the pep- 
tide bond between residues 372-373 as well as those between 
residues 362-363, residues 374-375, and residues 382-383, 
resulting in the formation of microheterogeneous fragments 
of 72 kDa. Based on this conclusion in addition to our earlier 
observations (7, 8, 11), it is reasonable to consider that the 
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Fie;. 1, Autoradiograms of blot hybridization analysis of 
human placental poly (A)* RNA with cDNA probes. 
HflmHK2358)-B«mH'l(2609) 251 -bp* fragment from pPARSl <». 
/fcal(1775M/(/Iil958) 183-bp fragment from pPARS21 (fc), 
l//^dl]lf6971-/Ccc)RI(l012) 315-hp fragment from pPARS3l (ch and 
/^oIil53)-to;l(368) 2lo-bp fragment from pPARS41(d) were used. 



DNA binding domain consists of 372 residues, starting from 
the N -terminal methionine to residue 372 with a calculated 
molecular weight of 42.018 and that the automod ill cation 
domain consists of 152 residues starting from residue 373 to 
residue 524 with a calculated molecular weight of 16.304. :t 
This interpretation gives a calculated value of 71,185 for the 
molecular weight of the 72-kDa fragment. 

On the basis of the above consideration, the amino acid 
composition predicted for each domain of human poly {ADP- 
ribose) synthetase is presented in Table III. It is noted that 
the net charge of the NAD binding domain is neutral but the 
DNA binding domain is considerably rich in lysine residue 
and quite basic, whereas the aulomodification domain is rel- 
atively basic. Fig. 5 show^s a hydrophilicity profile for the 
deduced amino acid sequence of the native enzyme. As ob- 
served in this figure, the enzyme protein appears to be rela- 
tively hydrophilic. From the predicted secondary structure, it 
is estimated that the NAD binding domain appears to he 
somewhat rich in /7-sheet in relation to other domains. 

Homology with Other Protein* -A computer search of the 

protein sequence bank (28, 29, 33) did not reveal any partic- 
ular protein with a striking overall homology to human 
poly (ADP -ribose) synthetase. Nevertheless, a separate com- 
parison of the localized sequence of each enzyme domain with 
those of other proteins indicated that. poly(ADP-rihose) syn- 
thetase has partial sequence homology with some other pro- 
teins. Of particular interest is the fact that the DNA binding 



2 The abbreviations used are: bp, base pair; kb, kilobase; HPLC, 
high performance liquid chromatography; PTH. phenylthiohydan- 
toin; SDS, sodium dodecyl sulfate; TPCK. i,-l-!;osyl-amido*2-phen- 
yl ethyl ehloro methyl ketone. 

a The molecular weight of the automodification domain is some- 
what low as compared with that determined experimentally (8). This 
may he due to the endogenous mono( ADP- ribose) or oligofADP- 
ribose) attached to the domain although direct evidence along this 
line is lacking. 
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Fig. 5. Hydrophilicity profile and predicted secondary 
structure of human poly(ADP-ribose) synthetase. The averaged 
hydrophilicity value of an undecapeptide composed of amino acid 
residues i - 5 to i ' + 5 has been plotted against i, where i represents 
amino acid number. The hydrophilicity values of individual amino 
acids have been taken from the data of Hopp and Woods (31). The 
position of the predicted structure of a-helix (open boxes) or 0-sheet 
(dotted boxes) that has a length of 10 or more residues is shown (32). 
Arrows indicate cleavage sites by papain and by a-chymotrypsin to 
form three separable enzyme domains. 

domain contains the homologous sequence with those of on- 
cogene products such as c-fos and v-/os (Fig. 6). Also, this 
domain involves a homologous repeat in the sequence (Fig. 6) 
and contains a unique sequence element (Fig. 7) which resem- 
bles the essential peptide sequences for nuclear location of 
S V40 and polyoma virus large T antigens (37-39). It is further 
noted that the NAD binding domain has a sequence similar 
to the consensus sequence for the binding of adenine nucleo- 
tide (Fig. 7) as are observed with various ATPases and ade- 
nylate kinase (40). 

DISCUSSION 

Using the clones isolated, we have determined most of the 
nucleotide sequence of the cDNA for human po!y(ADP-ri- 
bose) synthetase. A striking feature of the 5 '-untranslated 
region, although only partially determined, is the G+C rich- 
ness (71.6%) in relation to the content in the coding region 
(51.2%). The 3' -untranslated region contains the conical pol- 
yadenylation signal 17 bases upstream from the poly(dA) 
tract. From the coding sequence, we predict that the enzyme 
consists of 1,013 amino acid residues with a molecular weight 
of 113,203. This value is very close to that established exper- 
imentally (8). The amino acid composition predicted from the 
cDNA sequence also coincides well with that previously de- 
termined (8). Furthermore, the coding sequence involves all 
of the amino acid sequences of 14 different peptides as ob- 
tained by proteolytic digestion of the native enzyme. 

Comparison of the total amino acid sequence as deduced 
from the cDNA with the partial amino acid sequences at the 
N termini of the 54- and the 72-kDa fragments allowed us to 
assign the locations of the functionally different three do- 
mains of poly(ADP-ribose) synthetase. Based on the amino 
acid composition predicted for each domain (Table III), the 
DNA binding domain is rich in lysine residue and quite basic 
(18.8% basic amino acid residues, the net charge +15). Thus, 
this domain appears to easily bind to DNA by ionic interac- 
tion. In fact, the DNA binding domain from other species is 
known to be basic as judged by direct amino acid analysis 
(11). The automodification domain is also rich in lysine 
residue and basic (the net charge +7). In contrast, the NAD 
binding domain contains less lysine and arginine residues, 



and the net charge of this domain is neutral. It is noted that 
the relative content of glutamic acid residue in the auto- 
modification domain, to which poly(ADP-ribose) is attached, 
is the highest among the three domains of the enzyme protein. 

Homology of the total amino acid sequence of poly(ADP- 
ribose) synthetase with those of other proteins is not so 
striking. Nevertheless, the partial amino acid sequence of the 
enzyme somewhat resembles those of several other proteins. 
Of particular interest is the observation that the DNA binding 
domain has a homologous repeat in the sequence and exhibits 
a homology with localized regions of transforming proteins 
such as c-fos and v-fos (Fig. 6). Furthermore, this domain 
contains a unique sequence element similar to those of SV40 
and polyoma virus large T antigens which are required for 
their nuclear localization. Therefore, these facts suggest the 
possibility that the physiological function of poly(ADP-ri- 
bose) synthetase lies in its ability to bind to DNA and to 
control transformation of living cells like the cases of those 
oncogene products. Analysis of genomic DNA in normal and 
transformed eukaryotic cells using cDNA probes isolated in 
the present study will clarify the above problems. 
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Supplementary Mat«n*l to 

Primary Structure of Hunan Poly (ADP-ribose) Synthetase 
as Deduced from cDNA Sequence 
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Shigetaka Su*uki, Miehiko Hatsuda, Yoshiko Hatsuda , 
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Tadeaki Hirose, SeiichL inayama and Vutake Shituu 

EXPERIMENTAL PROCEDURES 

Haterlals —-A cDNA library constructed in pcD ID) using poly(AI*MJA from 
cultured human fibroblasts inontransformed) was kindly provided by Dr. 

H. Okayama at National Institut.es of Health. Tor transformation 
performed by the procedure of Wahl et al_i (14 1, Escherichi a cell HO 101 
was used. " ' 

Co-mereial sources at materials used were as follows: (.-"PldCTP 
(J ,000 Ci/mmoll ond |,-"p|ATP (5,000 Ci/mmol) from Amcrsham Corp.; 
oliqo(dT)-c#Hulose (Type 7} from P-L Biochem icelsi Psti-cut and 

01 igoldG) -tai led pBR33? from Beth* id* Research Laboratories! E. coli DMA 
polymerise 1. the Klenow fragment of c ol i DMA polymerase, DMA 
ligase and T4 polynucleotide kinase !rom~Toyobo Co.i E. coli DVA ligase, 
B_. coli nbonuclease H. 51 nuclease, terminal deoxyr iBonucTeot idyl trans- 
ferase, bacterial alkaline phosphatase. T4 DMA polymerase and Balll 
nuclease from Takara Shuxo Co.; reverse transcriptase of avian myelo- 
blastosis virus from Seikageku Kogyo Co. ; restriction endonucleesos from 
Takara ShusO Co. and Toyobo Co.* TPCK-trypsin from Wor thing ton i 
•-ehymotrypsin from Siqme: bacto tryptone, yeast extract and casamino 
acids from Difto. All other chemicals wore obtained from Nakarai 
Chemicals (Kyoto) and Wako Pure Chemicals Industries (Osaka) and of 
analytical grade. 

Amino Acid Sequence Analysis- --Poly I ADP-r ibose ) synthetase was purified 
to homogeneity from rresh human placenta and subjected to limited 
digestion with papain or ■.-ehymotrypsin ao described previously (8) ■ 
The proteolytic diqests were separated by DMA -eel lulose chromatography, 
followed by Bio-Gel P-$0 chromatography (8. 11 J or SOS-polyacryi amide 
gel electrophoresis (15). Electron tut ion was done using the methods of 
Hunkapillsr ot al_. (16). SDS in each band e lectroeluted from tho gels 
was removed by extraction using the method of Konigsberg and Henderson 
Ml). Tryptic digest ion was performed using 180 ug of poly IADP-rltK>*e) 
synthetase and ) uq of TPCK-trypsin in 50 mH Tris-Cl IpH 8.0) containing 

2 mM CaCl , in a final volume of 940 pi. After incubation at 17*C for 15 h. 

3 ug of TPCK-trypsin was again added and the incubation Ms continued for 
additional 13 h. The tryptic digest was directly loaded onto a column 

of Chomcosorb Ji-C- HIS x 15 mm; Chcmcol equilibrated with solvent A 
(HjO.acetom t rile; 10* trif luoroacet ic acid • 90 t 10 ; I) . A linear 
gradient of 10-601 acetomtnle was applied using solvent B (H j 0: 
acetonitr il*: 101 tr i I luoroacet ic acid - iO : *0 ■ )) over 80 min at a 
flow rate of 2 ml /rain; Hitachi 63S HPLC system with a solvent program 
was employee. The eluate absorbancc at 310 run and at 280 ntn was 
simultaneously recorded with a Hitachi 41 UV monitor. Twelve different 
peptides out of 72 absorbance peaks thus obtained were collected and 
subjected to sequence analysis with a gas-phase sequencer, model 470A, 
Applied Biosystems. PTH amino acids were analysed by HPLC at S0*C 
usinq a column of Ultra sphere ODSI4.6 x 250 no, Altexl . 

Cloning Procedures - --Table 1 represents a list of cONA libraries and 
screening probes employed for selecting the clones containing cDNA 
inserts for human poly (ADP-r ibose) synthetase. Specific ol igodeoxy ribo- 
nucleotides synthesized for preparing primer primed cONA libraries and 
the clones selected after screening are also presented in the table- 

Tor first screening of the cONA library constructed in pcO (13), 
ol igodeo*y ribonucleotide probes were radiolabeled with I ,-'*P]ATP and T4 
polynucleotide kinase. Replica f 11 tors were manipulated by the method 
Of Hanahan and neselson I IB) and baked at BD*C for 3 h. They were 
washed at 60'C for 1 h with J x 5SC containing O.lt SDS and then treated 
at 60 'C for 3 h with SO yg/ral of sonicated coif thymus denatured DMA, 
3 « SSC and 10 x Denhardt's solution. Hybridization with two sets of 
4) mer <5 x 10* cpm/nl) was performed at 55'C tor 36 h in 4 x SSC, 10 x 
Oenhardt'« solution, 150 og/ml at sonicated calf thymus denatured DHA 
and 0.05* sodium pyrophosphate. Filters were washed at 55"C for 15 min 
with 4 x SSC. next at 55*C for 10 min with 3 x SSC and further at room 
temperature for 2 min with 4 x SSC. Kyor idixat ion with two other tats 
of 17 mer 15 « 10' cpm/ml| was performed at 38'C tor 36 h in 4 x SSC. 10 x 
Denhardt's solution, 150 i,g/»l of sonicated calf thymus denatured DHA and 
0.05* sodium pyrophosphate. In this case, filters were washed three 
times at 3«*C for 15 min each with 4 x SSC. 

OUgo(dT) cDNA library from human placental mRNA was prepared as 
follows, Total SNA was extracted from fresh human placenta by ^ the 
guanidine thiocyanate method of Chirgwin et a_K (19). Poly|A)*RNA was 
enrich«d by oligo (dT) -eel lulose chromatography (20). Twenty micrograms 
of poly(A)+PNA was converted to cONA in the presence ot 50 mM Tris-Cl 
(pH 8. J), 1 mM each of Uie deoxyrlbonuclcotide triphosphateaT 50 raM KC1, 
6 mt? MgClj, 10 an di thiothre itol . 300 jg/ml of ol igo (dTI , 1 - , , and 2 units/ul 
of avian reverse transcriptase in a final volume of 100 ul. Incubation 
was performed at 42"C for SO min. The mixture was extracted with phenol- 
chloroform and the cdna formed was precipitated with 0.3 « sodium 
acetate <pH 5.5) and 2,5 volumes of ethanoL. The pcecLpitate was 
collected by centrifugation. The pallet was lyophilized and then 
dissolved in ISO l1 of double distilled water followed by the addition of 
30 ul of 1.8 N NaOH containing 50 mM EDTA. The mixture was bested at 70*C 
for 13 min and then neutralized by the addition ol 3? ul of 2 N HC) and & 
kl of IN Tris-Cl (pH 7.-SK Second strand synthesis was performed in the 
presonce~of 100 mM He pas (pH 6,9), 0.5 *« each of the deoxyrlbonuciao- 
tidc triphosphates. 70 mH KC1, 5 mH MgClT. 15 mM 3 -mercaptoethanol and 1 
gnit/jl of the Klsnow fragment of col i DMA polymerase in a final 
volume of 100 ul • Incubation was carried out at 15*C for 10 h. After 
extraction with phenol-chloroform and precipitation by et Hanoi, the cDKA 
was treated at J?*C [or 30 min with 3) nuclease at a final concentration 
of 2 units/ul in 30 mH sodium acetate <pH 4.4), 250 mM NaCl , 1 mM XnSO k in 
a final volume of 150~i.l. The mixture was again extracted with ~ 
phenol -chloroform and precipitated with 2 M ammonium acetate and 3 
volumes of sthenol . Oligo(dC) tailing of the cDNA was carried out at 
37*C for 15 min in 25 mQ Tris-Cl (pH 6.9), 200 tnM sodium cacodylat*. O.l' 
n« dCTP, 2 mH dl thiothre itol , L mH CoCli and 0.7 unit/Ml of terminal 
deoxyr ibonucleot idy I trens f erase in a final volume of lOOul. The double 
stranded cOSA thus prepared was collected and then annealed into the 
pat I site of ollqo tdC) -tailed pBR33I, For screening the clones, the 
restriction fragment as a DMA probe (Table I) was red io label led by nick 
translation with l-<-"P]dCTP 131). Hybridisation was performed at 60*C for 
18 h with 5 x 10* cpm/ml of the DNA probe in 3 x SSC, 10 x Denhardt's 
solution and 150 ug/ml of sonicated calf thymus denatured DNA. The 
filters were washed at 60*C for IS min with 3 x SSC containing O.lt SDS , 
then at «0*C for 15 min with I x SSC containing 0.U SDS and finally at 
room temperature for I min with 0.3 x SSC. 

For preparing specific ol igodeoxy r ibonucleot ids primed cDHA 
libraries, 100 pg of poly (A) *RNA was mixed with 500 pmol of a correspond- 
ing ol iqodeoxyribonuclect ide (see Table I) in 0.4 M XC1 and the solution 
was incubated at 10 *C for 60 min. Primer extension was carried out at 
42'C for SO min in 50 mM Tri»-Cl tpM 8.J1, 1 mM each of the deoxyr ibonuc- 
leotidc triphosphates, ?0 mM KC1, < mH HgCl,, To nH dithiothreitol and 

I. 200 units/ml of avian reverse transcriptase in a" final volume of 1 ml , 



After extraction with phenol -chloroform and precipitation with 2 H 
ammonium acetate and 2 volumes of ethanol, second strand synthesis was 
performed at 15*C for 1 h in 20 mM Tris-Cl IpH 7.5), 0.4 mM each of the 
deoxyr ibonucleot ide triphosphates? 100 mH KC1, 10 mH ammonium sulfate, 4 
mH «gCl/, 335 -jH 6-WAO, 50 ug/ml bovine serum albumin. <00 units/ml of E. 
coli DNA polymerase J. 90 units/ml of coli ribonucleaae K and 90 
units/ml of E. c oli PHA ligase in a 1 inal volume of 1 ml . The mixture 
waa extractedwlth'phenot -chloroform and subjected to precipitation with 
2 H ammonium acetate and 2 volumes of ethanol. The cDMA was then 
treated at J7*C for 30 min with 0.3 unit/jl of T4 DMA polymerase in 3J mM 
Tris-acotate (pH 6.9 1. 150 jM each of the deoxyr ibonucleottde 
triphosphates, (6 mH potassium acetate, 10 nH magnesium acetate, 0,5 mH 
dithiothreitol and 100 bg/rnl of bovine eeruaTalbumin in a final volume'of 
70 ul. Oligo(dC) tailing and cDNA insertion into pBR323 were performed 
by the same procedure as described above. Hybridisation with other 
corresponding DNA probes and filter willing were performed at 60*C using 
the same cocktails as those employed for screening the oligofdT) cDNA 
library. All of the cloning procedures were conducted in accordance 
with the guideline for research involving recombinant DHA molecules 
issued by the Ministry of Education, Science and C-'ture or Japan. 
01 jgodeoxyr 1 bonucleot ide Synthesis — The 17 residue and the 41 residue 
long aeoxy ribonucleotides were aynthesised by the modified triester 
method (32). They were purified by HPLC and subjected Co extensive 
dialysis prior to use- The base sequences of 41 mem were designed on 
the b»«U of the published data lot cocon usage frequencies 123), 

DNA Sequence Analysis The dideoxy sequencing method (34) using 

T^ 1 ' PldCTP was employed. Suitable restriction fragments or fraqments 
treated with Bel 3) were either subcloned into the MlJ»pU,l9 or the 
PUC19 veetorsHT?5) . 

Northern Blot Analysis The procedure of Thomas (26) was used. 

Briefly, twenty micrograms of polytA) + RNA was denatured at 50»C for 1 h 
in 1 M glyoxal and 50% dimethyl sulfoxide. The R.MA sample was 
elect rophoresed for 2 h 30 min on a 1.01 agarose gel at a constant 
voltage of 100. The SNA was transferred from the gel to a nitro- 
celluloae sheet equilibrated with 30 x SSC. After baking at BO'C tor 2 
h, the sheet was incubated at 42'C for 40 h in the hybridisation cocktail 
containing nick-translated restriction fragment (4 x )0* epn/»l , specific 
activity 1-3 x 10' cpm/ug) . SOt formamide, 5 x SSC. 50 mH sodium phosphate 
buffer (pH 6.3). 2S0 ug/ml of sonicated salmon sperm denatured OKA, 0.021 
bovine scrum albumin, 0.02* F icq 11 and 0.021 polyvinylpyrrolidone. The 
sheet was washed four times at room temperature for 5 min with 3 x SSC 
containing O.lt SDS and furthor washed twice at 50*C for 15 min with 0.1 
x SSC containing 0.11 SDS. The RNA band hybrid ited was visual tied by 
exposing the sheet to * New RXO-H film (Fuji) at -70'C for 4B h. 

Southern Blot Analysis Positive clones after screening with 

o 1 igodcox y r 1 bpnuc 1 eo 1 1 de probes were subjected to DNA blot analysis 
(27) . One microgram of DNA, digested with Hincll or Bam H I was 
elcctrophoresed on a 1.5% agarose gel at a constant voltage of 100. The 
gel was soaked twice at room temperature for 20 min in 0.3 (V tl+OH and 
0.6 M NaCl, and then incubated twice at room temperature far 30 min in 
0.2 ri Tris-C) (pH 7,4) containing 0.6 K NaCl . The DNA sample was 
transferred to a nitrocellulose sheet equilibrated with t x SSC. After 
baked at 80 *C for 2 h, the sheet was washed at 60*C for 30 min with 3 x 
SSC containing 0.1% SDS and Again washed at 60*C for 1 h with 3 x SSC , 
0.1% SDS and 10 x Denhardt's solution. It was further incubated at 60'C 
for 1 h in 6 x SSC, 10 x Denhardt's solution, 0.1% SDS , 0.1 mg/ml of 
sonicated salmon sperm denatured DNA and O.051 sodium pyrophosphate. 
Thti sheet was hybriditod at 5S*C for IS h with 2 x 10* cpm/ml of the 
"p-labelled 4 1 jner probe in 6 x SSC, 10 x Denhardt" a' solution, 0.1% SDS, 
0.1 mg/ml of sonicated salmon sperm denatured DNA and 0.05% sodium 
pyrophosphate. The sheet was washed at 55*C for is min with 6 x SSC, 
0.11 SDS and 0.21 sodium pyrophosphate. The second washinq was 
performed under the same conditions except that 0,3 x SSC was used in 
place ot 6« SSC. Autoradiography was performed at -70*C for 50 h using 
a New RXO-H film (Tujil . 

Computer Analysis Data of the nucleotide and the deduced amino acid 

sequences were analysed by the program of "Integrated Database and 
Extended Analysis System for Nucleic Acids and Proteins* (IDEAS) 
according to tho method of Ksnehise (29, 39). 



RESULTS 

Partial Amino Acid Sequence The initial approach used to clone cona 

sequences for poly (ADP-r ibose) synthetase was to determine tht? partial 
amino acid sequences of tho enzyme protein which provided the data far 
preparing ol igodeoxy ribonucleotide probes for isolating cONA clones. 
Limited proteolysis of the human placental enzyme with papain yields two 
fragments of H r - 44K and 72K, the former corresponding to the DMA binding 
domain and the latter containing both domains for autocsodi f ication and 
for binding of NAD (8, 11). Partial digestion with -.-ehymotrypsin results 
in the formation of two fragments of 54K and 62K, the former 

corresponding to the NAD binding domain and the latter containing both 
domains for binding of DNA and for automod if ica t ion (B, 111. Since oach 
proteolytic fraqment of human origin migrated a* a broad band or a 
doublet on SDS-polyacrylamide gel electrophoresis (8), it was further 
purified as described under "Experimental Procedures" and subjected to 
amino acid sequence analysis by automated Edstan degradation. 

The initial analysis of the 4IK fragment (the DNA binding domain) 
as well as the native enryma revealed no N-terminal sequence) no 
significant amount of any particular PTH amino acid was recovered 
during the cycles. This finding is consistent with earlier observations 
with the eniyme from other species (5, 7| that the N-terminui of 
poly (ADP-r ibose) synthetase is blocked and that the DNA bindng dona in is 
located in the N-terminal side of the native eniyme III), The analysis 
of the 72X fragment was also unsuccessful due to microhetcrogenelty Of 
the sample prepared although a sequence of Ala-Ser as well as that of 
Ala-Pro was suggested. 

We then analysed the $4K fragment (the NAD binding domain) and the 
4 OK fragment, the latter derived from the 54K fragment upon prolonged 
digestion with o-chymotryps in. Sequencing of the S4K fragment revealed 
12 amino acid residues of the N-terminal sequence trig. 1AI. Similarly, 
in the 40X fragment identifications were made out to 36 residues with 
three positions not definitely determined) in cycle 35, 13, and 35 both 
PTH proline and PTH glutamatc were detected (Fig, 161. The "act that 
the N-termlnal sequence of tho 54X fraqment is completely different from 
that of the 4 OX fraqment indicates that tho 40K fragment lacks the N- 
terminal portion of the S4K fragment. In order to obtain some more 
information of the partial amino arid sequences, poly (ADP-r ibose ) 
synthetase was digested with TPCK-trypsin and some of the peptides 
isolated by the KPLC column were subjected to amino acid sequence 
analysis as described under "Experimental Procedures". Table II 
summarises the results of such experiments. As shown in the table. 
12 other amino acid sequences were determined, 

isolation of cDNA Clones Based on tho amino acid sequence of the 

N-ter»inua of the 40K fragment, two sets of 41 residue long deoxyr ibo- 
nucleotides were chemically synthesized as described under "Experimental 
Procedures". Two other sets of 17 residue long mixed deoxyr ibonuc leo- 



cDNA Sequence for Poly(ADP-ribose) Synthetase 



tides which represented til possible eONA sequences corresponding to the 
partial dpi no acid sequence determined were also prepared (tee Table I). 
Using one set of the 41 residue long deoxyr ibonuc leot idea, we first 
screened the human fibroblasts cONA library (« total of x 10* 
tiansfornantsl and obtained 28 candidate clones. These candidate clones 
were further screened with three other sets o( ol igodeoxyr ibonuc leot ide 
probes and only one positive clone was obtained as judged by blot 
hybridiist ion analysis (Table I). The nucleotide sequence of the cOHA 
insert ().9fcbl of this clone (pPARS-r) contained an open reading frame 
that Included the amino acid sequence Of the N-terminua of the 4QK 
fragment as well as those of five different tryptic peptides (Tabic II), 

He then screened the oliqo(dT) cONA library from human placenta 
using the Sau3A I (28661 -EcoRl (30831 fraqnent of clone pPARS-F as a hybrld- 
i tat ion probe . Two posit ivc clones were obtained after screenings] x 10* 
transf ornants . The nucleotide sequence analysis of one clone (pPARSl) 
containing a longer cONA insert revealed that the sequence of the cONA 
insert was exactly identical with the sequence (2798-3211) of clone 
pPARS-F (fig. 1). We therefore attempted to elongate the specific oligo- 
deoxyribonucleotide corresponding to the sequence near the S' end of the 
cDKA insert of clone pPARSl by reverse transcription of human placental 
mUNA and to clone the resulting cQNA transcript into the plasmid pBR322. 
Screeninq of this cDtlA library by hybridization with the AIu I (1989)- 
Alu 1(2096) fragment derived fro* clone pPAJtS-F led to theTsolat ion of 
clone pPARSll. Trie nucleotide sequence of the cDMA insert of clone 
pPARSll was idontical with the sequence (1963-2314) of clone pPARS-F 
(Fig. 3). 

In parallel experiments, we prepared another cDNA library using 
human placental mRMA as a template in order to elongate the 17 residuo 
long mixed deoxyr ibonuc leot ides which represented all possible cDNA 
sequences corresponding to the partial amino acid sequence of the 40K 
fragment. Screening of the library by hybridisation with the 
SauJAI (178 2) -AluI(13S8) fragment of the cDNA from clone pPARS-r ted to 
the isolation of Clone pPARS21 which coded for an open reading frame 
including both amino acid sequences of the N-termini of the S4K fragment 
and the 40K fragment. Furthermore, this clone also coded for an open 
leading frame corresponding to the tryptic peptide. T-49-2 (Table in. 

To select clones harbouring cD*A sequences for a region further 
upstream, two other sets of specific primer cDNA libraries were prepared 
iTable I) and 3 clones designated Jt pPAKSJl, pPAPSJ2, and pPARSS 1 were 
obtained. The nucleotide sequence analyses in comparison with partial 
amino acid sequences of tryptic peptides (T-49-l, T-36, T-33-l, T-33-2, 
.ind T-64-1 in Table II) del ived from the purified enzyme indicate that 
their cDNA sequences involved all residual coding regions including the 
translations! initiation site of tho human placental mRNA for 
poly (ADP-r ibosol synthetase . 
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Pig. 1. S equence analysis of pept ides . . -chymotrypt ic fragments of 
poly (AOP-ribose) synthetase - ? rem human placenta were prepared and 
subjected to amino acid sequence analysis as do scribed under 
'experimental Procedures". The yields of PTM amino acids recovered from 
the S4K fragment (A) and the 4QK fraqnent (fi) at each cycle of Ed man 
degradation arc shown; one-letter amino aci3 notation it used. 
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5 ' TGCGGCTGGGTGAGCGCACGCGAGGCGGCGAGGCGGCAGCGTGTTTCTAGGTCG7GGCGTC6GGCT7CC6GAGCTTTGGCGGCAGCTACGGGAGG - 1 

1 10 20 30 40 50 

MetAlaGluSerSerAspLysLeuTyrArgValGluTyrAlaLysSerGluA^^ 

ATGGCGGAGTCTTCGGATAAGCrCTATCGAGTCGAGTACGCCAAGAGCGA 150 

60 70 80 90 100 

TrpTyr-HisPheSerCysPheTrpLysVdlG1yHlsSerlleArgHi$ProAspVaIGluVd1AspG1yPheSerGluLcuArgTrpAspAspGl nGlnLysValLysLys7hrAlaGluAlaG7yGlyVa1ThrGlylysGlytflnAsp 
TGG7ACCACT7C7CC7GCTTC7GGAAGG7GGGCCACTCCATCCGGCACCC7GACGT7GAGGTGGA7GGG7TC^^ 300 

HO 120 130 140 150 

GlylleGlySerLysAlaGlulysThrLeuGlyAspPheAlaAUGluTyrAlaLysSer^^ 

GGAATTGGTAGCAAGGCAGAGAAGiACTCTGGGTGACTTTGCAGCA&AGTA 450 

160 170 180 190 200 

LeuGlyHetlleAspArgTrpTyrHfsProGlyCysPheVaKysAsnArgGluGluleuGlyPheArgProGluTyrfc^ 

CTAGGCATGATTGACCGCT6GTACCATCCAGGCTGCTTTGTCAAGAACAGGGAGGAGCTGGGTTTCCGGCCCGAGTACAGTGCGAGTCAGCTCAAGGGCT7CAGCCTCCTTGCTACAGAGGAT 600 

210 220 230 240 250 

ClyVallysSertluGlyLysArgLysGlyAspLysValAspGlyValAspGluValAlaLysLyslysSerLysL^^ 

GGAGTCMGAGTGAAGGAAAGAGAAMGGCGATAAGGTGGATGGAGTGGATGAA^ 750 

260 270 280 290 300 

GluLeuLysLysVaKysSer7hrAsnAspleulysGluLeuLeunePheAsnLysGlnGlnValPro5erGlyGluSerAl^^ 

GAKTAAAGAAAGTGTGTTCAACTAATGACCTGAAGGAGCTACTCATCTTC 900 

310 320 330 340 350 

GlnLeuValPheLys5erAspAla7yr7yrCys7hrGlyAspVal7hrAla7rp7hrLysCystetValLysTh^ 

CAGCTGGTC7TCAAGAGCGATGCCTAT7AC7GCAC7GGGGACGTCACTGCCTGMCCAAG7GTA7GGTCAAGACACAGACACCCAACCGGAAGGAGTGGG7M 1050 

360 V 370 V V 380 V 390 400 

lyslysGl nAspArg 1 1 ePheProProGI uThrSerAl aSerVaUl aHl sProPro^ 

AAAAA&CAGGACCGTA7A7TCCCCCCAGAAACCAGCGCC7CCG7GGCCCACCC7a^ 1200 

410 420 430 440 450 

SerArgAsnLysAspG1uValLysAlaMetIleGlulysLeuGlyGlyLysLeu7hrGly7hrAlaA$nlysAla$erl^^ 

TCCCGGAACAAGGATGWGTGAAGGCCA7GAT7GAGAJWCTCGGGGGGAAGTTAACGGG6ACGGCCWCAAGGC7TCCC7GTGCA7CAGCA 1350 

460 470 480 490 500 

ArgValValSerGluAspPheleuGlnAspVa15erAlaSerThrlys$€rU^ 

CGAGTTG7G7C7GAGGACTTCC7CCAGGACG7CTCCGCCTCCACCAAGAGCCT7CA 1500 

510 520 f 530 540 550 

AlaLeuSerLysLysSerLysGlyGlnValLysGluGlu&lyMeAsnlysSerGlu^ 

GCGCTC7CCAAAAAAAGCAAGGGCCAGGTCAAGGAGGAAGGTA7CAACAM7CTGAAAAGAGA 1650 

560 570 580 590 600 

ValPheSerAla7hrteuGlyLeuValAspneValLyiGlyThrAsnSer7yr7yrL^ 

GTCTTCAGTGCCACCC77GGCC7GGTGGACATCG77AMGGAACCAACTCC7ACTACAAGC7GM^ 1800 

610 620 630 640 650 

GluGlnMetProSeriy&GluAspAlalleGluHlsPheMetlysLeuTyrGluGl^^ 

GAACAGATGCCGTCCAAGGAGGATGCCA7TGAGCACTTCATGAMTTA7AT6AAGAAAAMCCGGGAACGC7TGGCAC7CCAAAAA7TTCACGAAGTA 1950 

T 660 670 680 690 700 

ValLysLysLeuThrVaTAsnProGlyThrLysSerLysLeuProLysProValGl^ 

G7GMGMGCtGACAGTAAATCCTGGCACCAAGTCCAAGCTCCCCAA^ 2100 

710 720 730 740 750 

SerLysArgG1nI1eGlnAlaAla7yrSerI1eLeuSerG1uValGlnGlnAlaValSerG1 nGly5erSerAspSerG1nIleLeuAspLeuS«rAsnArgPhe7yrThrLeu]leProHisAspPheGlyMetLysLysProProLeu 
AGCAAMGGCAGATCCAGGCCGCA7AC7CCA7CC7CAGTGAGGTCCAGCAGGCGG7GT 2250 

760 770 780 790 800 

LeuAsnAsnAlaAspSerValGlnAlaLysValGluHeUeuAspAsnLeuLeuAspIleGtuVa^ 

C7GAACAATGCAGACAG7G7GCAGGCCAAGG7GGAAA7GCnGACAACC7GC7GGACA7CGAGGTGG^^ 2400 

810 820 830 840 850 

LysValValAspArgAspSerGluGluAlaGluI1eMeAr^lys7yrValLysAsn7hrHisA1a7hrThrHisAsnAU^ 

AAGGTGCTTGACAGAGATTCTGAAGMGCCGAGATCATCAGGAAGTATGnAAGM 2550 

860 870 880 890 900 

LysGlnLeuHisAsnArgArgleuLeuTrpHisGlySerArgThrThrAsnPh^ 

AAGCAGCT7CATAACCGAAGATTGC7GrGGCACGGG7CCAGGACCACCAAa^ 2700 

910 920 930 940 950 

SerLys5erAlaAsn7yrCysHis7hrSerGlnGlyAspProIleGlyteuI1eleuLeuGlyG1uValAlaLeuG1y^ 

7CCAAGAGTGCCAACTAC7GCCATACGTCTCAGGGAGACCCAA7AGGC77AA7CC^^ 2850 

960 9 70 980 990 1000 

Glylys7hrThrProAspPro$erAlaAsnlleSerLeuAspGlyValAspValProLeuG1yThrGlyMeS 

GGCAAAACTACCCCTGATCCTTCAGCTAACATTAGTCTGGATGGTG7AGACG7T 3000 
1010 

LeuLeuLysLeuLysPheAsnPheLysfhrSerLeuTrp 

CTGC7GAMC7GAMT7CMTT77MGACCTCCCTG7GGTAATTG«»G^ 3150 
7AAACCACC7CAGAAAGM7T77ACAGAAACG7GTTAAAGGTT77C7C7MC7TC7CA^ 3300 
MGGCTG6AGAGAGAT7C7GTTGCA7AGACTAG7CC7A7GGAAAAMCCAAGC77CGT 3450 
GATG77AAGCAT77A7T7T7AG77AAAAA7AAAMC7M77TCA7ACTA777AGA7T77CT77TT7A7CT7GCA^ 3600 
AACAA7T7CTCA7AC7TAGAAACAAAAAGAGCTTTCCT7C7CCAGGAA7ACTGAACA7GGGAGC7C77GAAA7ATG7AGTA77AAAAGT7GCA777G 3* 

Fig, 3. Nucleotide Sequence of cloned cDNA encoding human poly(AOP-Hbose) synthetase . 7he nucleotide sequence was determined with clones pPARS-F, pPARSl. 
pPARSil, pPARSZI, pPAR531, pPA~Ks3Z , and~pTA r RSTT! Nucleotide residues are nuaiered in the 5' to 3* direction, beginning with the first residue of the A7G 
triplet encoding the initiative methionine and the nucleotides on the 5' side of residue 1 are indicated by negative numbers i the number of the nucleotide 
residue at the right end of each line is given. 7he deduced amino acid sequence of poly(ADP-ribose) synthetase is shown above the nucleotide sequence and 
the amino acid residues are numbered beginning with the initiative methionine. Open triangles denote the sites cleaved by papain, and closed triangles 
indicate the sites cleaved by a-chymotryps1n to form the 54K and the 40K fragments. Residue 3,697 in the cDNA insert of clone pPARS-F is followed by a poly 
(dA) tract, which is connected with the vector ONA sequence (13). 
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rig. 7. (I) Comparison of the unique sequence el ernent of the DNA binding 
flori n wTF R the essential peptide sequences for nuclear location of Sv<5 
£p PoWoiU ~7r rut iarge~ *T antigens . {II) Comparison of the ~sequ«nc»~ o7 
2je HAD binding "domain with the consensus sequence for the bind ing of 
adenine nucleotide In the above figure. PAPS stands for " 

poly (AOP-nbose) synthetase, PMV represents polyoma viru» end X con be 
any amino acid residue. Note that two other unique sequence elements 
(Lys-Lys-Lcu-Lys-Vel-lys-Lys and Lys-Arg-Lys-Cly-Asp- Lys-Val ) which my 
specifically recogniie the DNA ligand are present in the DMA binding 
domain (residues 346-352 and residues 207-213 in rig. 3). Also, note 
that two Other sequences similar to that for the binding Of adenine 
nucleotide are present in the DNA binding domain (residues 93-106 and 
residues 201-2)6 in Fig. 3). 



